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A novel member of the Heparanase protein famlty 
Field of the Invention 

This Invention relates to newly Identified polypeptides and 
polynucleotides encoding such polypeptides sometimes hereinafter 
referred to as „heparanase-2", to their use In diagnosis and In Identitying 
compounds that may be agonists, antagonists that are potentially useful j 
in therapy, and to production of such polypeptides and polynucleotides. 

Background of the Invention 

The drug discovery process is currently undergoing a fundamental i 
revolution as it embraces 'Htnctlonal genomics", that is, high throughput 
genome- or gene-based biology. This approach as a means to identify 
genes and gene products as therapeutic targets Is rapidly superceding 
earlier approaches based on "positional cloning". A phenotype, that is a 
biological function or genetic disease, would be identified and this would 
then be tracked back to the responsible gene, based on its genetic map 
position. 

Functtonal genomics relies hea>nly on high-throughput DMA sequencing 
technologies and the various tools of bloinfomwrtics to Identify gene 
sequences of potenttal interest from the many molecular biology databases 
now available. There Is a continuing need to identify and characterise 
further genes and their related polypeptides/f>roteins, as targets for drug 
discovery. 



Summary of the Invention 

The present invention relates to heparanase-2. in particular heparanase- 
2 polypeptides and heparanas©-2 polynucleotidas, recombinant materials 
and methods for tiieir produdion. Such polypeptides and polynucleotides 
are of interest in relation to methods of treatment of certain diseases, 
including, but not limited to, autoimmune disorders, blood coagulation 
disorders, cancer, diabetes, ischemia, sepsis and stroke, hereinafter 
refen-ed to as " diseases of the invention'*. In a further aspect, the 
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Invention relates to methods for Identifying agonists and antagonists 
(e.g.. Inhibitors) using the materials provided by the invention, and 
treating oondttions associated with heparanase-2 Imbalance witii the 
Identified compounds. In a still further aspect, the Invention relates to 
diagnostic assays for detecting diseases associated with Inappropriate 
heparanase-2 activity or levels. 



Description of the Invention 

In a first aspect, the present invention relates to heparanase-2 
10 polypeptides. Such polypeptides include: ^ 

(a) an Isolated polypeptide encoded by a polynucleotide comprising the 
sequence of SEQ ID NO:1; 

(b) an isolated polypeptide comprising a polypeptide sequence having at 
least 95%, 96%, 87%, 98%, or 99% Identity to the polypeptide sequence 

15 ofSEQIDNO:2: 

(c) an isolated polypeptide comprising the polypeptide sequence of SEQ 
ID NO:2; 

(d) an Isolated polypeptide having at least 95%, 96%, 97%, 98%, or 99% 
Identity to the polypeptide sequence of SEQ ID NO:2; 

20 (e) the polypeptide sequence of SEQ ID NO:2; and |p 

(f) an isolated polypeptide having or comprising a polypeptide sequence 
that has an Identity Index of 0.95, 0.96, 0.97, 0.98, or 0.99 compared to 
the polypeptide sequence of SEQ ID NO:2; 

(g) fragments and variants of such polypeptides in (a) to (f), 

25 Polypeptides of the present invention are believed to be members of the 

endoglucuronidase fiamily of polypeptides. Heparan sulfate proteoglycans 
(HSPGs) are ubiquitous macromolecules of cell surfaces, basement 
membranes and the extracellular matrix (ECM), They play a major role in 
cell-cell and cell-extracellular matrix Interactions, HSPGs have been 

30 reported to bind to a variety of different molecules like growth factors (e.g. 

fibroblast growth factors and platelet-derived growth factor), cytoldnes 
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(e.g. lnterleukin-2). extracellular matrix proteins <e.g. fibronectin. lamlnin. 
collagen), factors involved In blood coagulation (e.g. antithronnbin III), and 
other proteins such as lipoproteins. DNA topoisomerases, and a-annyloid 
proteins (Kjellen, L. and Lindahl. U.. Annu. Rev. Biochem. 60. 443-475 
5 (1991); WIgM. T.N. et al.. Curr. Opin. Cell Biol. 4. 793-801 (1 992)). 

Binding of cignallfng molecules tn HSPGg leads to their sequestration, 

thereby creating a localised, readily accessible depot of these bound 
molecules, which can easily be released upon degradation of HSPGs 
(Nissen. N. et al.. Biochem. J, 338. 637-642 (1999)). Additionally HSPGs 
10 are important structural components of the ECM. In capillaries they are 

found mainly in the subendothelial basement membrane, where they 
support the vascular endothelium and stabilise the structure of the 
capillary wall. Expression of heparan sulfate (HS)-degrading 
endoglucuronidases, commonly called 'heparanases' (Nakajima, M. et 
15 al., J. Biol. Chem. 259. 2283-2290 (1984)). is found in blood borne cells 

and placental trophoblasts, reflecting their requirement for cell diapedesls 
activity associated with inflammatory processes or wound healing, and 
pregnancy, respectively (Vlodavsky. 1. etal. Invasion Metastasis 12, 112- 
127 (1992); Goshen. R. et al.. Mol. Hum. Reprod. 2, 679-684 (1996)). 

20 Degradation of the HS moieties of HSPGs affects a great variety of 

biological processes. Of particular Interest is the proposed funcfion of 
heparanases in neoangiogenesis and metastasis, associated witn 
malignant tumours. It has been shown that secreted heparanase aoBvity 
induces endothelial mitogenesis (Folkman. J. et a!.. Am. J. Pathol. 13, 

25 393-400 (1988): Ishal-Wlichaeli, R. et al.. CeH. Regol. 1, 833-842 (1990)) 

and is directly correlating with the metastatic property of a number of 
human metastatic cell lines as well as specimens of human breast, colon 
and liver carcinomas (Nakajima. M. et al., Science 220. 611-613 (1983); 
VlodavsKy, l. et al.. Cancer Res. 43. 2704-2711 (1983)). 

30 White it is commonly acknowledged that heparanases play a cmcial role 

in many Important biological processes, it is not at all trivial to isolate this 
enzymatic activity, mainly because of instability of the enzymes and the 
difficulty in designing specific quantitative assays (Freeman. C. and 
Parish. C.R.. Biochem. J. 325, 229-237 (1997)). Biochemical experiments 

35 Identified so far three distinct groups of heparanase activities, with 

molecular weights of 137 kDa (Oosta. HQ.M. et al., J. Biol. Chem. 2S7. 
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11249-11255 (1982)). 50 kDa (Freeman. C. and Parish, C.R,, Biochem, 
J. 330, 1341-13509 (1998)), and 32^0 kDa (Hoogewerf. A.J. et aL. J, 
Biol. Chem. 270, 3268-3277 (1995)). Now, three independent groups 
report the successful cloning of the first mammalian heparanase gene 
s representing the SO kDa class (VIodavsky. I. et at., Nat Medicine 5, 793- 

802 (1999); Hulett, M.D. et aL, Nat. Medldne 5, 803-809 (1999); 

r oyoshima M, and Nakajima M., J. BId I , Chem. 270, 24 1 53-24 leo 

(1999)). For the first time it is now possible to analyse moleoularjy this so 
far elusive enz\mrie class. RNA In situs show that this heparanase gene is 

10 preferentially expressed in highly metastatic mouse and human cell lines 

and In biopsy specimen of human tumours. Moreover, increased levels of 
heparanase were detected in sera (Nakajima, M. et al., Science 220, 
611-613, (1983)) and urine of metastatic tumour-bearing animals and 
cancer patients (VIodavsky. I. et aL. Nat. Medicine 5, 793-802 (1999)). 

15 Transfection of low or non-metastatic tumour cell lines with the 

heparanase gene confers a high metastatic potential in experimental 
mice, resulting in an increased rate of mortality. On the contrary, 
treatment of experimental animals with heparanase Inhibitors (e.g., non- 
anticoagulant species of low-molecular-weight heparin and polysulfated 

20 saccharides) considerably reduces the incidence of lung metastases by 

melanoma, Lewis lung carcinoma and mammary adenocarcinoma cells 
(Vtodavsky I. et aL. Invasion Metastasis 14. 290-302 (1995), Parish, C.R. 
etaL, InL J. Cancer 40, 511-517 (1987)). 

As it is generally accepted that heparanase activity plays a crucial role In 
25 many distinct biologfeal processes, there is a clear need to identify further 

members of this protein family. This will help to develop tools for early 
detection and treatment of aberrant signalling processes, whioh are 
involved in various diseases such as (but not limited to) cancer 
metastasis, autoimmune disorders, or diabetes. While the first member of 
30 the 50 IcDa class of heparanases has now been cloned, no further 

heparanase gene has been identified so far. The heparanase activity 
represented by the 32-40 kDa class was proposed to consist of a 
tetramer of four subunlts each closefy related to CXC chemoklnes 
(CTAP-lll/NAP-2). However, the enzymatic activity of this complex results 
53 in the generation of disaccharides (Hoogewerf, et aL, J. Biol. Chem. 

270. 3268-3277 (1995)), while typical heparanase activity generates 
characteristic large molecular weight fragments during the degradation of 
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the HS substrate (Toyoshima M. and Nakajima M., J. Biol. Chem, 270. 
24153-24160 (1999)). Here v^e report the Identification of a novel 
member of the heparanase protein family. It Is represented by a single 
EST partially encoding a novel protein (heparanase-2) closely r^ated to 
the 50 kDa heparanase gene. 

The biolo g ical proportioo of th e heparanase-? arft hftpfiinalter refened to 



biological activity of heparanase-2" or "heparanase-2 aetTvit/'. 
Preferably, a polypeptide of the present Invention exhibits at least one 
biological activity of heparana8e-2. 

Polypeptides of the present invention also includes variants of the 
aforementioned polypeptides, including all allelic fonns and splice variante. 
Such polypeptides vary from the reference polypeptide by Insertions, 
deletions, and substitutions that may be consen«tfve or non-conservative, 
or any combination thereof. Particulafly prefenred variants are those In 
which several, for instance from 50 to 30. from 30 to 20. from 20 to 10. from 
10 to 5. from 5 to 3. from 3 to 2. from 2 to 1 or1 amino acids are inserted, 
substituted, or deleted, in any combination. 

Ptefened fragments of polypeptides of the present Invention Include an 
isolated polypeptide comprising an amino acid sequence having at least 
30, 50 or 100 contiguous amino acids from the amino acid sequence of 
SEQ ID NO: 2. or an Isolated polypeptide comprising an amino acid 
sequence having at least 30. 50 or 100 contiguous amino acids truncated 
or deleted from the amino add sequence of SEQ ID NO: 2. Prefen-ed 
fragments are biologically active fragments that mediate the biological 
activity of heparanase-2. including those with a similar activity or an 
Improved activity, or with a decreased undesirable acUvity. Also preferred 
are those fragments ftiat are antigenic or immunogenic in an animal, 
especially In a human. 

Fragments of the polypeptides of the invention may be employed for 
30 producing the oonesponding fulWength polypeptide by peptide synthesis: 

therefbre. these variants may be employed as intennediates for 
producing the fulHength polypeptides of the invention.The polypeptides of 
the present Invention may be in the form of the "mature" protein or may 
be a part of a larger prc>teln such as a precursor or a fusion protein. It Is 
33 often advantageous to include an addrtlonal amino acid sequence that 




contains secretory or leader sequences, pro-sequences, sequences that 
aid in purification, for instance multiple histidine residues, or an additional 
sequence for stability during recombinant production. 

Polypeptides of the present invention can be prepared in any suitable 
5 manner, for Ir^stance by isolation form naturally occuring sources, from 
genetically engineered host cells comprising expression systems (wcte 

infra) or by chemical synthesis, using for instance automated peptide 

synthesisers, or a combination of such methods. Means for preparing such 

polypeptides ans well understood In the art. 



10 



In a furti^r aspect, the present invention relates to heparanase-2 
polynucleotides. Such polynucleotides include: 

(a) an Isolated polynucleotide comprising a polynucleotide sequence 
having at least 95%, 96%. 97%, 98%. or 99% identity to the 

15 polynucleotide squence of SEQ ID NO:1 : 

(b) an isolated polynucleotide comprising tine polynudeotide of SEQ ID 
NO:1; 

(c) an isolated polynudeotide having at least 95%, 96%, 97%. 98%, or 
99% identity to the polynucleotide of SEQ ID NO:1 ; 

20 (d) the isolated polynucleotide of SEQ ID NO:1 : 

(e) an isolated polynudeotide comprising a polynucleotide sequence 
encoding a polypeptide sequence having at least 95%. 96%, 97%, 98%, or 
99% identity to the polypeptide sequence of SEQ ID NO:2: 

(f) an isolated polynucleotide comprising a polynucleotide sequence 
25 encoding tiie polypeptide of SEQ ID NO:2; 

(g) an isolated polynucleotide having a polynudeotide sequence encoding 
a polypeptide sequence having at least 95%, 96%, 97%, 98%, or 99% 
identity to tiie polypeptide sequence of SEQ ID NO:2; 

(h) an isolated polynudeotide encoding the polypeptide of SEQ ID NO:2; 
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(i) an Isolated polynucleotide having or conr»prislng a polynucleotide 
sequence that has an Identity Index of 0.95. 0.96. 0.97, 0.98. or 0.99 
compared to the polynucleotide sequence of SEQ ID NO:1; 

(j) an isolated polynucleotide having or comprising a polynucleotide 
sequence encoding a polypeptide sequence that has an Identity Index of 
0.06, 0 gP, " Q^. " Q**. ^ compared to the polypeptide sequence of 
SEQ ID NO:2; and 

polynucleotides that are fragments and variants of the above mentioned 
polynucleotides or that are complementary to above mentioned 
polynucleotides, over the entire length thereof. 

Preferred fragments of polynucleotides of the present Invention include an 
isolated potynudeotlde comprising an nucleotide sequence having at least 
15, 30, 50 or 100 contiguous nucleotides from the sequence of SEQ ID 
NO: 1. or an isolated polynucleotide comprising an sequence having at 
least 30. SO or 100 contiguous nucleotides truncated or deleted from the 
sequence of SEQ ID NO: 1. 

Preferred variants of polynudeo^des of the present invention include 
splioe variants, allelic variants, and polymorphisms, including 
polynucleotides having one or more single nucleotide polymorphisms 
(SNPs). 

Polynucleotides of the present Invention also Include polynucleotides 
encoding polypeptide variants that comprise the amino acid sequence of 
SEQ ID NO-.2 and in v»fhioh several, for Instance from 50 to 30. from 30 to 
20. from 20 to 10. from 10 to 5. from 5 to 3, from 3 to 2. from 2 to 1 or 1 
amino add residues are substituted, deleted or added. In any combination. 

In a further aspect, the present Invention provides polynucleotides that 
are RNA transcripts of the DNA sequences of the present invention. 
Accordingly, there Is provided an RNA polynudeotide that: 

(a) comprises an RNA transcript of the DNA sequence encoding the 
polypeptide of SEQ ID NO:2; 

(b) is the RNA transcript of the DNA sequence encoding the polypeptide 
of SEQ ID NO:2: 
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(c) comprises an RNA transcript of the DMA sequence of SEQ ID N0:1; 
or 

(d) is the RNA transcript of the DMA sequence of SEQ ID NO:1 ; 
and RNA polynucleotfdes that are complementary thereto. 



The polynucleotide sequence of SEQ ID NO:1 shows homology with 
AF144325 (VIodavsky. I. et al., Nat. Medicine 5, 793-802(1989)). The 
poiynudeotide sequence of SEQ ID N0:1 is a cDNA sequence that 
encodes the polypeptide of SEQ ID NO:2, The polynucleotide sequence 

10 encoding the polypeptide of SEQ ID NO:2 may be Identical to the 

polypeptide encoding sequence of SEQ ID N0:1 or 'rt may be a 
sequence other than SEQ ID NO:1, which, as a result of the redundancy 
(degeneracy) of the genetic code, also encodes the polypeptide of SEQ 
ID NO;2. The polypeptide of the SEQ ID NO:2 is related to other proteins 

15 of the endoglucuronidase 'feimily, having homology and/or structural 

similarity with AAD42342 (VIodavsky. 1. et al., Nat Medicine 5, 793- 
802(1999)). 

Preferred polypeptides and polynucleotides of the present invention are 
expected to have, inter ali&, similar biological functions/properties to their 
20 homologous polypeptides and polynucleotides. Furthermore, preferred 

polypeptides and polynucleotides of the present invention have at least one 
heparanase-2 activity. 



Polynucleotides of the present invention may be obtained using standard 
25 cloning and screening techniques from a cDNA library derived from mRNA 

in cells of human bladder, (see for instance. Sambrook et aL, Molecular 
Cloning: A Laboratory Manual. 2nd Ed.. Cold Spring Harbor Laboratory 
Press. Cold Spring Harbor, N.Y. (1989)). Polynucleotides of the invention 
can also be obtained from natural sources such as genomic DNA libranes 
30 or can be synthesized using welt known and commercially available 

techniques. 



When polynucleotides of the present Invention are used for the 
recombinant production of polypeptides of the present Invention, the 
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polynucleotide may include the coding sequence for the mature 
polypeptide, by itseff, or the coding sequence for the mature polypeptide in 
reading frame with other coding sequences, such as those encoding a 
leader or secretory sequence, a pre-, or pro- or prepro- protein sequence, 
or other fusion peptide portions. For example, a marker sequence that 
facilHates purification of the fused polypeptide can be encoded. In certain 
preferred embod i ments of this a s p e ct of thQ Invent io n, the marker sern/ftnne 



is a hexa-histidine peptide, as provided in the pQE vector (Qiagen. Inc.) 
and described In Gentz et ai, Proc Natl Acad Sd USA (1989) 86:821^24, 
10 or is an HA tag. The polynucleotide may also contain non-coding 6' and 3* 

sequences, such as transcn'bed. non-translated sequences, splicing and 
polyadenylation signals, ribosome binding sites and sequences that 
stabilize mR^4A. 

Polynucleotides that are identrcal. or have sufficient identity to a 
15 polynucleotide sequence of SEQ ID N0:1, may be used as hybridization 

probes for cDNA and genomic Dl^ or as primers for a nucleic add 
amplification reaction (for instance, PGR). Such probes and primers may 
be used to isolate full-length cDNAs and genomic clones encoding 
polypeptides of the present invention and to isolate cDNA and genomic 
20 dones of other genes (including genes encoding paralogs from human 

sources and orthologs and paralogs from species other than human) that 
have a high sequence similarity to SEQ ID NO:1, typically at least 95% 
identity. Preferred probes and primers will generally comprise at least 15 
nudeotides, preferably, at least 30 nudeotides and may have at least 50, if 
25 not at least 100 nudeotides. PartJculariy preferred probes will have 

between 30 and 50 nudeotides. Particulariy prefemed primers will have 
between 20 and 25 nudeotides. 

A poJynudeottde encoding a polypeptide of the present invention, including 
homoiogs from spedes other than human, may be obtained by a process 

30 comprising the steps of screening a library under stringent hybridization 

conditions with a labeled probe having the sequence of SEQ ID NO: 1 or a 
fragment thereof, preferably of at least 15 nucleotides; and isolating full- 
length cDNA and genomic dones containing safd polynudeotide sequence. 
Such hybridization techniques are well known to the skilled artisan. 

35 Preferred stringent hybridization conditions indude overnight incubation at 

420C in a solution comprising: 50% formamlde, SxSSC (150mM NaCI, 
15mM trisodlum dtrate), 50 mM sodium phosphate (pH7.6X Sx Denhardt's 
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solution, 10 % dextran sulfate, and 20 microgram/ml denatured, sheared 
salmon sperm DNA; followed by washing the filters in 0,1 x SSC at about 
65^0. Thus tiie present invention also includes isolated polynucleotides, 
preferably with a nucleotide sequence of at least 100, obtained by 
5 screening a library under stringent hybridization conditions with a labeled 
probe having the sequence of SEQ ID NO:1 or a fragment thereof, 
pr e ferably of at toast 15 nuclcotidoa. 

The sicilled artisan will appreciate that, in many cases, an isolated cDNA 
sequence will be fncomplete, in that the region coding for the polypeptide 
10 does not extend all the way through to the 5' temiinus. This is a 

consequence of reverse transcriptase, an enzyme with inherently low 
"processlvrty" (a measure of the ability of the enzyme to remain attached 
to the template during the polymerisation reaction), falling to complete a 
DNA copy of the mRNA template during first strand cDNA synthesis. 

15 There are several methods available and well known to those skilled In 

the art to obtain full-length cDNAs, or extend short cDNAs, for example 
those based on the method of Rapid Ampllflcatfon of cDNA ends (RACE) 
(see, for example. Frohman et al.. Proc Nat Acad Sci USA 85. 8998- 
9002, 1988). Recent modificatfons of the technique, exemplified by the 

20 Marathon (trade mark) technology (Ciontech Laboratories Inc.) for 

example, have significantly simplified the search for longer cDNAs. In the 
Marathon (trade mark) technology, cDNAs have been prepared from 
mRfsjA extracted from a chosen tissue and an 'adaptor' sequence ligated 
onto each end. Nucleic acid amplification (PCR) is then carried out to 

25 amplify the "missing" 5' end of the cDNA using a combination of gene 

specific and adaptor specific oligonucleotide primers. The PCR reaction 
Is then repeated using 'nested' primers, that is, primers designed to 
anneal within the amplified product (typically an adaptor specific primer 
that anneals further 3* in the adaptor sequence and a gene specific 

30 primer that anneals fijrther 5' in the known gene sequence). The 

products of this reaction can then be analysed by DNA sequencing and a 
full-length cDNA constructed either by Joining the product directly to the 
existing cDNA to give a complete sequence, or carrying out a separate 
fulMength PCR using the new sequence information for the design of the 

3S 5' prim.er. 



Recombinant polypeptides of the present invention may be prepared fay 
processes well known in the art from genetically engineered host cells 
comprising expression systems. Accordingly, in a further aspect, the 
present Invention relates to expression systems comprising a 

5 polynucleotide or polynucleotides of the present invention, to host cells 

which are genetically engineered with such expression sytems and to the 

production of polypetatldes of «ie i nvent i o n b y reoombincnt t e chniq u es 

Cell-free translation systems can also be employed to produce such 
proteins using RNAs derived from the DMA constaicts of the present 

10 invention. 

For recombinant production, host cells can be genetically engineered to 
incorporate expression systems or portions thereof for polynucleotides of 
the present invention. Polynucleotides may be Introduced into host cells by 
methods described In many standard laboratory manuals, such as Davis et 

IS al Basic Methods in Molecular Biology (1986) and Sambroolc et al.Um. 

Preferred methods of introducing polynucleotides into host cells include, for 
instance, calcium phosphate transfection. DEAE-dexlran mediated 
transfecaon, tiansvecHon, microinjection, cattonic lipid-mediated 
transfection. electroporallon. transduction, scrape loading, ballistic 

20 introduction or infection. 

Representative examples of appropriate hosts include bacterial cells, such 
as StfBptocoGGi, Staphyiococd, E. cdi, Stmptomyces and Baallus subtihs 
cells- fungal cells, such as yeast cells and Aspergillus ceils; Insect cells 
such' as Drosophila S2 and Spodoptem Sf9 ceils; animal cells such as 
25 CHO. COS. HeLa. C127, 3T3, BHK, HEK 293 and Bowes melanotma cells; 

and plant cells. 

A great variety of expression systems can be used, for instance, 
chromosomal, episomal and virus-derived systems, e.g.. vectors derived 
from bacterial plasmlds. from bacteriophage, from transposons, from yeast 

30 episomes, from Insertion elements, from yeast diromosomal elements. 

from vinises such as baculovlmses, papova viruses, such as SV40. 
vaccinia viruses, adenoviruses, fbwl pox vimses. pseudorables viruses and 
retrovinjses. and vectors derived from combinations thereof, such as those 
derived from plasmid and bacteriophage genetic elements, such as 

35 cosmids and phagemids. The expression systems may contain control 

regions that regulate as well as engender expression. Generally, any 
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system or vector that is able to maintain, propagate or express a 
polynucleotide to produce a polypeptide In a host may be used. The 
appropriate polynucleotide sequence may be inserted Into an expression 
system by any of a variety of well-known and routine techniques, such as, 
for example, those set forth in Sambrook efaL, (ibid). Appropriate secretion 
signals may be Incorporated into the desired polypeptide to allow secretion 
or Tne transiaiea protein into tne lumen ot the end op l asmic reticulum, the 
periplasmic space or the extracellular environment. These signals may be 
endogenous to the polypeptide or they may be heterologous signals. 

If a polypeptide of the present invention is to be expressed for use in 
screening assays, it is generally preferred that the polypeptide be 
produced at the surface of the cell. In this event, the cells may be 
harvested prior to use in the screening assay. If the polypeptide is 
secreted into the medium, the medium can be recovered In order to 
15 recover and purify the polypeptide. If produced intracellularly. the cells 

must first be lysed before the polypeptide is recovered. 

Polypeptides of the present Invention can be recovered and purified from 
recombinant cell cultures by well-known methods including ammonium 
sulfate or ethanol precipitation, add extraction, anton or cation exchange 

20 chromatography. phosphocellulose chromatography, hydrophobic 

interaction chromatography, affinity chromatography, hydroxylapatfte 
chromatography and lectin chromatography. Most preferably, high 
perfonnance liquid chromatography is employed for purification. Well 
known techniques for refolding proteins may be employed to regenerate 

25 active confomnatlon when the polypeptide Is denatured during Intracellular 

synthesis, isolation and/or purification. 

Polynucleotides of the present invention may be used as diagnostic 
reagents, through detecting mutations in the associated gene. Detection of 
a mutated fbmn of the gene characterised by the polynucleotide of SEQ ID 
N0:1 in the cDNA or genomic sequence and which i$ associated with a 
dysfunction wni provide a diagnostic tool that can add to. or define, a 
diagnosis of a disease, or susceptibility to a disease, which results from 
under-expresskan, over-expression or altered spatial or temporal expression 
of the gene. Individuals carrying mutations in the gene may be detected at 
35 the ONA level by a variety of technkiues well known in the art 



10 



Nucleic acids for diagnosis may be obtained from a subjecTs cells, such as 
from blood, urine, saliva, tissue biopsy or autopsy material. The genomic 
DNA may be used directly for detection or it may be amplified enzymatically 
by using PGR. preferably RT-PCR. or other amplification techniques prior to 

5 analysis. RNA or cDNA may also be used in similar fashion. Deletions and 

insertions can be detected by a change in size of the amplified product in 

compar i son to the normal genotype. Point r i iutaUuni» udn b e i den tified by 

hybridizing amplified DNA to labeled heparanase-2 nucleotide sequences. 
Perfectly matched sequences can be distinguished from mismatched 

10 duplexes by RNase digestion or by differences In melting temperatures. 

DISIA sequence difference may also be delected by alterations in the 
electrophoretic mobility of DNA fragments in gets, with or without 
denaturing agents, or by direct DNA sequencing (see. for instance, Myers 
et at.. Science (1885) 230:1242). Sequence changes at specific locations 

15 may also be revealed by nuclease protection assays, such as RNase and 

S1 protection or the chemical cleavage method (see Cotton ef at., Proc Natl 
Acad Sci USA (1985) 86: 4397-4401). 

An array of oligonucleotides probes comprising heparanase-2 
polynucleotide sequence or fragments thereof can be constnicted to 

2C conduct efficient screening of e.g., genetic mutations. Such arrays are 

preferably high density arrays or grids. Array technology methods are well 
known and have general applicability and can be used to address a variety 
of questions In molecular genetics including gene expression, genetic 
linkage, and genetic variability, see, for example, M.Chee et al., Science, 

25 274 ,610-613(1 996) and other references cited therein . 

Detection of abnomnally decreased or increased levels of polypeptide or 
mRNA expression may also be used for diagnosing or determining 
susceptibility of a subject to a disease of the Invention. Decreased or 
Increased expression can be measured at the RNA level using any of the 

30 methods well known in the art for the quantitation of polynucleotides, 

such as, for example, nucleic acid amplification, for instance PCR, RT- 
PCR, RNase protection. Northern blotting and other hybridization 
methods. Assay techniques that can be used to detennlne levels of a 
protein, such as a polypeptide of the present Invention, In a sample derived. 

35 ftom a host are well-known to those of skill In the art Such assay methods 

include radioimmunoassays, competitive-binding assays, Western Btot 
analysis and B.ISA assays. 
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Thus fn another aspect, the present invention relates to a dlasonostic kit 
comprising: 

(a) a polynucleotide of the present invention, preferably the nucfeotfde 
sequence of SEQ ID NO: 1, or a fragment or an RNA transcript thereof; 

(b) a nucleotide sequence oompfementary to that of (a); 



(c) a polypeptide of the present invention, preferably the polypeptide of 
SEQ ID N0:2 or a fragment thereof; or 

(d) an antibody to a polypeptide of the present Invention, preferably to the 
polypeptide of SEQ ID N0:2. 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise 
a substantial component Such a kit vvill be of use in diagnosing a 
disease or susceptibility to a disease, particularly diseases of the 
invention, amongst ottiers. 



The polynucleotide sequences of the present Invention are valuable for 
chromosome localisation studies. The sequence Is specifically targeted to. 
and can hybridize with, a particular location on an individual human 
chromosome. The mapping of relevant sequences to chromosomes 
according to the present invention is an Important first step in correlating 
those sequences with gene associated disease. Once a sequence has" 
been mapped to a precise chromosomal location, the physical position of 
the sequence on the chromosome can be oonreiated with genetic map data 
Such data are found In. for example. V. McKusick. Mendellan Inheritance in 
Man (available on-lfne through Johns Hopkins Unlverelty Welch Medical 
Ubrary). The relationship between genes and diseases that have been 
mapped to the same chromosomal region are then identified through 
linkage analysts (co-inheritance of physically a<«aoent genes). Piecise 
human chromosomal localisations for a genomic sequence (gene 
fragment etc.) can be determined using Radiation Hybrid (RH) Mapping 
(Walter. M. Splllett, D,. Thomas. P.. Weissenbach. J., and Goodfellow, P.. 
(1884) A method for constructing radiation hybrid maps of wliole 
genomes. Nature Genetics 7. 22-28). A number of RH panels are 
available from Research Genetics (Huntsviile. AL. USA) eg the 
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GeneBridge4 RH panel (Hum Mol Genet 1996 Mar.5(3):339-46 A- 

radiation hybrid map of the human genome. Gyapay G, Schmitt K. 

FIzames C. Jones H, Vega-Czamy N, Spillett D. Muselet D. Prud'Homme 

JF. Dib C. Autfray C. Morissette J, Weissenbach J. Goodfellow PN), To 
s determine the chromosomal location of a gene using this panel, 93 PCRs 

are performed using primers designed from the gene of interest on RH 
DNAs. Eooh of these PNAs contains random human genomic fragments 

maintained in a hamster background (human / hamster hybrid cell lines). 

These PCRs result in 93 scores indicating the presence or absence of 
10 the PGR product of the gene of interest. These scores are compared 

with scores created using PCR products from genomic sequences of 
^ known location. This comparison is conducted at 

^ http://Www.genome.wi.mit.eduy. 

The polynucleotide sequences of the present Inventfon are also valuable 
IS tools for tissue expression studies. Such studies allow the detennlnatlon of 

expression patterns of polynucleotides of the present invention which may 
give an Indication as to the expression pattems of the encoded 
polypeptides In tissues, by detecting the mRNAs that encode them. The 
techniques used are well known in the art and include In situ hydridlsatkjri 
20 techniques to clones anrayed on a grid, such as cDNA microarray 

hybridisation (Schena et a/. Science. 270. 467-470. 1995 and Shalon ei af, 
Genome Res, 6. 639-645, 1996) and nucleotide amplification techniques 
such as PCR. A preferred method uses the TAQMAN (Trade mari<) 
technotogy available from Perkin Elmer. Results from these studies can. 
© 25 provide an Indication of the normal function of the polypeptide In the 

organism. In addition, comparative studies of the normal expression 
pattern of mRNAs wHh that of mRNAs encoded by an alternative form of 
the same gene (for example, one having an alteration in polypeptide coding 
potential or a regulatory mutation) can provide valuable Insights Into the role 
30 of the polypeptides of the present invention, or that erf inapptxspriate 

expressbn thereof in disease. Such inappropriate expression may be of a 
temporal, spatial or simply quantitative nature. 

The polypeptides of the present invention are repressed in bbod cells, 
cancer tissues, fetal liver, lymph nodes, placenta, spleen, trophoblast cells. 
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A further aspect of the present Invention relates to antibodies. The 
polypeptides of the inventbn or their fragments, or cells expressing them,' 
can be used as immunogens to produce antibodies that are ImnDunospecilic 
for polypeptides of the present invention. The temri "Immunospecific" 
5 means that the antibodies have substantially greater affinity for the 
polypeptides of the invention than their affinity for other related polypeptides 
in t h e p rior a rt 

Antibodies generated against polypeptides of the present invention may be 
obtained by administering the polypeptides or eprtope-bearing fragments, or 

10 cells to an animal, preferably a non-human animal, using routine protocols. 

For preparation of monoclonal antibodies, any technique whic^ provides 
antibodies produced by oontinuous oeH line cultures can be used. 
Examples include the hybridoma technique (Kohler, G. and Milstein, C.,- 
Nature (1975) 256:495-497). the trioma technique, the human B-cell 

15 hybridoma technique (Kozbor et &L, Immunology Today (1983) 4:72) and 

the EBV-hybridoma technique (Cole et al., Monoclonal Antibodies and 
Cancer Therapy, 77-96, Alan R. Liss. Ina, 1985). 

Techniques for the production of single dialn antibodies, such as those 
described In U.S. Patent No. 4,946.778, can also be adapted to produce 
20 single diain antibodies to polypeptides of this Invention. Also, transgenic 

mice, or other organisms, including other mammals, may be used to 
express humanized antibodies. 

The above-desolbed antibodies may be employed to isolate or to identify 
clones expressing the polypeptide or to purify the polypeptides by affinity 
25 chromatography. Antibodies against polypeptides of the present InvenBon 

may also be employed to treat diseases of fiie invention, amongst otlTers. 



Polypeptides and polynucleotides of tiie present invention may also be 
used as vaccines. Accordingly, in a further aspect, tiie present invention 

30 relates to a method for inducing an immunological response in a mammal 

that comprises inoculating the mammal with a polypeptide of the present 
invention, adequate to produce antibody and/or T celi immune response, 
including, for example, cytokine-produdng T cells or cytotoxic T cells, to 
protect said animal from disease, whether that disease is already. 

35 established within the individual or not. An Immunological response in a 



mammal may also be Induced by a method comprises delivering a 
polypeptide of the present invention via a vector directing expression of. 
the polynucleotide and coding for the polypeptide in vivo In order to 
induce such an immunological response to produce antibody to protect 
5 said animal from diseases of the invention. One way of administering the 

vector is by accelerating it into the desired cells as a coating on particles 

or oth e rw i c e , — Such nuci e ic aoid v e ctor may comprlGO DNA, RNA. o 

modified nucleic acfd, or a ONA/RNA hybrid. For use a vacctne, a 
polypeptide or a nucleic acid vector will be nomiaily provided as a 

iO vaccine formulation (composition). The formulation may further comprise 

a suitable carrier. Since a polypeptide may be broken down In the 
stomach, rt is preferably administered parenterally (for instance, 
subcutaneous, intramuscular, intravenous, or intradermal Injection). 
Formulations suitable for parenteral administration Include aqueous and. 

15 non-aqueous sterile Injection solutions that may contain anti-oxidants, 

buffers, baoteriostats and solutes that render the formulation instonic with 
the blood of the recipient; and aqueous and non-aqueous sterile 
suspensions that may include suspending agents or thickening agents. 
The formulations may be presented in unit-dose or muiti-<io$e containers, 

20 for example, sealed ampoules and vials and may be stored in a freeze- 

dried condition requiring only the addition of the sterile liquid canier 
immediately prior to use. The vaccine formulation may also include 
adjuvant systems for enhancing the immunogentcity of the formulation, 
such as oil-in water systems and other systems known in the art. The 

25 dosage will depend on the specific activity of the vaccine and can be 

read ily determined by routine experimentation. 

Polypeptides of the present invention have one or more biological functions 
that are of relevance in one or more disease states, in particular the 
diseases of the Invention hereinbefore mentioned. It is therefore useful to 

30 to identify compounds that stimulate or inhibit the function or level of the 

polypeptide. Accordingly, in a further aspect the present invention 
provides for a method of screening compounds to identify those that 
stimulate or inhibit the function or level of the polypeptide. Such methods 
Identify agonists or antagonists that may be employed for therapeutic and 

3S prophylactic purposes for such diseases of the invention as hereinbefore 

mentioned. Compounds may be identified from a variety of sources, for 
example, cells, cefl-free preparations, chemical libraries, collections of 
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chemical compounds, and natural product mixtures. Such agonfete or 
antagon^ts so-Iderrtifled may be natural or modified substrates. ligands 
receptors, enzymes, etc.. as the case may be. of the polypeptide- a" 
strx^otural or functional mimetic thereof (see Collgan et Z Current 
Protocols In Immunology l (2):Chapter 5 (1991)) or a small molecule. 



^^^"'"g "^^thod may sim oly measure the binding ^ candidate 

compound to the polypeptide, or to cells or membranes bearing the 
polypeptide, or a fusion protein thereof, by means of a label directly or 
indtrectly associated with the candidate compound. Alternatively the 

^ur^tLT^tT ""^^ '"''^"^ "'^^^"""^ ''^'^'''^ (qualitatively or 
quantitatively) the competitive binding of a candidate compound to the 

polypeptide against a labeled competitor {e.g. agonist or antagonist) 

Further, these screening methods may test v^ether the candidate 

compound results in a signal generated by activation or inhibition of the- 

potypeptide. using detection systems appropriate to the cells bearing the 

polypeptide, inhibitors of activation are generally assayed in the 

presence of a known agonist and the effect on activation by the agonist 

by the presence of the candidate compound is observed. Further the 

screening methods may simply comprise the steps of mbdng a candidate 

compound with a solution containing a polypeptide of the present 

invention, to fbmi a mixture, measuring a heparanase-a activity in the 

m,)cture, and comparing the heparanase.2 activity of the mixture to a 

control mixture which contains no candidate compound. 

Polypeptides of the present invention may be employed In conventional 
ow capacity screening methods and also In hlgh-throughput screening 
(HTS) fomnats. Such HTS fonnats include not onfy the well-established 
use of 98- and. more recently, 384-welI micotiter plates but also emerging 
methods such as the nanowell method described by Schuliek et al. Anal 
Biochem.. 246, 20-29. (1 997). 

30 Fusion proteins, such as those made from Fc portion and heparanase.2 

polypeptide, as hereinbefore described, can also be used for 
hlgh-throughput screening assays to Identify antagonlste for the 
polypeptide of the present invention (see D. Bennett et at., J iWol 

35 270(1 6):9459-8471 (1995)). 
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Screening techniques 

TTie polynucleoticfes, polypeptides and antibodies to the polypeptide the 
present invention may also be used to configure screening methods for* 
detecting the effect of added compounds on the production of mRNA and 
polypeptide In cells. For example, an £LISA assay may be constructed 
for moocuring secreted nr fiff.ll ass<?Q{at9d levels of polypeptide using 



monoclonal and polyclonal antibodies by standard methods known In the 
art. This can be used to discover agents that may Inhibit or enhance the 
production of polypeptide (also called antagonist or agonist, respectively) 
from suitably manipulated cells or tissues. 

A polypeptide of the present invention may be used to identify membrane 
bound or soluble receptors, if any. through standard receptor binding 
techniques known in the art These include, but are not limited to, llgand 
binding and crosslinking assays in which the polypeptide is labeled with a. 
radioactive Isotope (for instance. 125|), chemically modified (for Instance, 
blotlnylated). or fused to a peptide sequence suitable for detection or 
purification, and incubated with a source of the putative receptor (cells, 
cell membranes, cell supernatants. tissue extracts, bodily fluids). Other 
methods Include biophysical techniques such as surface plasmon 
resonance and spectroscopy. These screening methods may also be 
used to identify agonists and antagonists of the polypeptide that compete 
with the binding of the polypeptide to Its receptors. If any. Standard 
methods for conducting such assays are well understood In the art. 

Examples of antagonists of polypeptides of the present Invention Include 
antibodies or. in some cases, oligonucleotides or proteins that are closely 
related to the ligands, substrates, receptors, enzymes, etc., as the case 
may be, of the polypeptide, e.g., a fragment of the ligands, substraiee, 
reoeptore. enzymes, etc.; or a small molecule that bind to the polypeptide of 
the present invention but do not elldt a response, so that the activity of the 
polypeptide is prevented. 

Screening methods may also Involve the use of transgenic technology 
and heparanase-2 gene. The art of constructing transgenic animals is 
well established. For example, the heparanase-2 gene may be 
introduced through microinjection Into the male pronucleus of fertilized 
oocytes. reUoviral transfer into pre- or post-implantation embryos, or 
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ZT^^Z TT.. -'ectroporatfon. embryonic 

he human equivalent wrthin the genome of that animal. Knock-in 
transgenic an.mafs are useful in the drug discovery process for taroet 
val.da t.on. Where the comp ounci is specific for t,e huLnl^ir X 

expresscon of the animal ortholog of a polypeptide J L p^^en! 
.rven^on and encoded by an endogenous DNA sequence in a cell is 
parn^ly or completely annulled. The gene knockout may be targeted to 
.peorfic ceJte or tissues, may occur only in certain cells or tissues as a 
consequence of the limftahons of the technology, or may occur in all or 
IT "1 all. cells in the animal. Transgenic animal technology also 
Offers a whole animal expressfon^lonlng system in which lntrx>duced 
fZZtn ^"^"^''^"^ '^'^^ of polypeptides of the present 

r^"'Ilf.H'"' "'^ '^^^^^^^ "^^^^^^^ ^ '"rther 

aspect of the present invention. Such screening kits comprise: 

(a) a polypeptide of the present Invention.- 

(b) a recombinant cell expressing a polypeptWe of the present invention; 

(c) a cell membrane expressing a polypeptide of the present Invention; or 

(d) an antfbody to a polypeptide of the present invention; 
which polypeptide is preferably that of SEQ ID NO:2. 

a substantial component 



Glossary 



The following definitions are provided to facilitate understending of certain 
temis used frequently hereinbefore. 
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"Antibodies" as used herein includes polyclonal and monoclonal 
antibodies, chlmenc, single chain, and humanized antibodies, as well as 
Fab fragments, including the products of an 

Fab or other immunoglobulin expression library. 

"Isolated" means altered "by the hand of man" from its natural state, i.e.. 
if ft occurs In nature, it has been changed or removed from Its original 
environment, or both. For example, a polynucleotide or a polypeptide 
naturally present in a living organism is not •isolated," but the same 
polynucleotide or polypeptide separated from the coexisting materials of 
its natural state is "isolated", as the term is employed herein. Moreover, 
a polynucleotide or polypeptide that is Introduced into an organism by 
transformation, genetic manipulation or by any other recombinant method 
is "isolated" even if it is still present in said organism, which organism 
may be Jiving or non-living. 



"Polynucleotide" generally refers to any polyribonucleotide (RNA) or" 
polydeoxribonucleotide (DNA), which may be unmodified or modified 
RNA or DNA. "Polynucleotides" include, without limitation, single- and 
double-stranded DMA. DNA that is a mixture of single- and double- 
stranded regions, single- and double-stranded RNA, and RNA that Is 
mixture of single- arwi double-stranded regions, hybrid molecules 
comprising DNA and RNA that may be single-stranded or, more typically, 
double-stranded or a mixture of single- and double-stranded regions. In 
addition, "polynudeotide" refers to triple-stranded regions comprising 
RNA or DNA or both RNA and DNA. The term "polynucleotide" also 
includes DNAs or RNAs containing one or more modified bases and 
DNAs or RNAs with backbones modified for stability or for other reasons. 
"Modified" bases include, for example, tritylated bases and unusual bases' 
such as Inosine. A variety of modifications may be made to DNA and 
RNA; thus, *polynucleotlde" embraces chemically, enrymatically or 
metabolicaily modified forms of polynucleotides as typicaiiy found in 
nature, as well as Vne chemical forms of DNA and RNA characteristic of 
viruses and cells. "Polynucleotide" also embraces relatively short 
polynudeotfdes. often referred to as oligonucleotides. 

"Polypeptide" refers to any polypeptide comprising two or more amino 
acids joined to each other by peptide bonds or modified peptide bonds. 
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i.e., peptide isosteres. "Polypeptide" refers to both short chains, 
commonly referred to as peptides, oligopeptides or oligomers, and to 
longer chains, generally referred to as proteins. Polypeptides may 
contain amino acids other than the 20 gene-encoded amino acids. 
"Polypeptides" include amino acid sequences modified either by natural, 
processes, such as post-translational processing, or by chemical 
modificati on teehniqueA that are vwfl known In the a rt . Such 



modifications are well described in basic texts and in more detailed 
monographs, as well as in a voluminous research literature. 

10 Modifications may occur anywhere in a polypeptide, including the peptide 

baclcbone. the amino acid side-chains and the amino or carboxyl termini. 
It will be appreciated that the same type of modification may be present 
to the same or \/arying degrees at several sites in a given polypeptide. 
Also, a given polypeptide may contain many types of modifications. 

IS Polypeptides may be branched as a result of ubiquifination, and they may 

be cyclic, with or without branching. Cyclic, branched and branched 
cydic polypeptides may result from post-translation natural processes or. 
may be made by synthetic methods, lulodificat'ons include acetylaWon, 
acylation, ADP-ribosylation, amidatlon, blotlnylation, covalent attachment 

20 of flavin, covalent attachment of a heme moiety, covalent attachment of a 

nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid 
derivative, covalent attachment of phosphotidyllnositol, cross-linking, 
cyclization, disuffide bond formation, demethylation, formation of covalent 
cross-links, formation of cystine, formation of pyroglutamate, formylation, 

25 gamma-carboxylation. glycosylatlon, GPl anchor formation. 

hydroxyfation, iodlnation. methylation. myristoylatlon, oxidation, 
proteolytic processing, phosphoiylation, prenylatlon, racemlzation. 
selenoylation. sulfation. transfer-RNA mediated addition of amino adds to 
proteins such as arglnylation, and ubiquftfnatlon (see, for Instance,. 

30 Proteins - Structure and Molecular Properties, 2nd Ed., T. E. Crelghton. 

W. H. Freeman and Company, New York, 1993; Wold, F., Post- 
translational Protein Modifications: Perspectives and Prospects. 1-12, in 
Post-translational Covalent Modification of Proteins, B. C. Johnson, Ed., 
Academic Press, New York. 1983; Seifter et aL, "Analysis for protein 

35 modifications and nonprotein cofactors", Meth Enzymol, 182, 626-646, 

1990, and Rattan af a/., "Protein Synthesis: Post-translational 
Modifications and Aging", Ann NY Acad Sd, 663, 48-62. 1992). 
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"Fragment" of a polypeptide sequence refers to a polypeptide sequence 
that is shorter than the reference sequence but that retains essentially the 
same biological function or activity as the reference polypeptide. 
"Fragment" of a polynucleotide sequence refers to a polynucloetide 
5 sequence that is shorter than the reference sequence of SEQ ID NO:1.. 

'Variant" refers to a polynucleotide or polypeptide that differs from a 

reference polynucleotide or polypeptide, but retains the essential 
properties thereof. A typical variant of a polynucleotide differs in 
nucleotide sequence from the reference polynucleotide. Changes in the 

10 nucleotide sequence of the variant may or may not alter the amino acid 

sequence of a polypeptide encoded by the reference polynucleotide. 
Nucleotide changes may result In amino acid substitutions, additions, 
deletions, fusions and truncations in the polypeptide encoded by the 
reference sequence, as discussed below. A typical variant of a 

15 polypeptide differs in amino acid sequence from the reference 

polypeptide. Generally, alterations are limited so that the sequences of 
the reference polypeptide and the variant are closely similar overall and, 
in many regions, identical. A variant and reference polypeptide may differ 
In amino acid sequence by one or more substitutions, insertions, 

20 deletions in any combination. A substituted or inserted amino add 

residue may or may not be one encoded by the genetic code. Typical 
conservative substitutions include Gly, Ala; Val, He, Leu; Asp, Glu; Asn, Gin; 
Ser, Thr, Lys, Arg; and Phe and Tyr. A variant of a polynucleotide or 
polypeptide may k>e naturally occurring such as an allele, or it may be a 

2S variant that is not known to occur naturally. Non-naturally occurring 

variants of polynucleotides and polypeptides may be made by 
mutagenesis techniques or by direct synthesis. Also included as variants 
are polypeptides having one or more post*transiational modifications, for 
instance glycosylation, phosphorylation, methylation, ADP ribosylation- 

30 and the like. Embodiments include methylation of the N-terminal amino 

acid, phosphorylations of serines and threonines and modifigation cf C- 
terminal glycines. 

"Allele'' refers to one of two or more alternative forms of a gene occuring 
at a given locus In the genome. 
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"Polymorphism" refers to a variation In nucleotidB sequence (and 
encoded polypeptide sequence, rf relevant) at a given position in the 
genome v^in a population. 

"Single Nucleotide Polymorphism" (SNP) refers to the occurence of 
5 nucleotide variability at a single nucleotide position In the genome, within 

a population. An SNP may occur within a gene or vWthin intergenic 
regions of the genome. SNPs can be assayed using Allele Specific 
Amplification (ASA), For the process at least 3 primers are required. A 
common primer is used in reverse complement to the polymorphism 
10 being assayed. This common primer can be between 50 and 1500 bps 

from the polymorphic base. The other two (or more) primers are identical 
to each other except that the final 3' base wobbles to match one of the 
two (or more) alleles that make up the polymorphism. Two (or more) 
PGR reactions are then conducted on sample DNA, each using the 
t5 common primer and one of ttie Allele Specific Primers. 

*'Splice Variant" as used herein refers to cDNA molecules produced from 
RNA molecules initially transcribed from the same genomic DMA 
sequence but which have undergone alternative Rl^iA splicing. 
Alternative RNA splicing occurs when a primary RNA transcript 
20 undergoes splicing, generally for the removal of Introns, which results In 

the production of more than one mRNA molecule each of that may 
encode different amino add sequences. The term splice variant also 
refers to the proteins encoded by the above cDNA molecules. 

"Identity" reflects a relationship between two or more polypeptide 
25 sequences or two or more polynucleotide sequences, determined by 

comparing the sequences. In general, identity refers to an exact 
nucleotide to nucleotide or amino acid to amino add correspondence of 
the two polynucleotide or two polypeptide sequences, respectively, over 
the length of the sequences being compared. 

30 "% Identity"' - For sequences where there is not an exact 

correspondence, a "% identity" may be determined. In general, the two 
sequences to be compared are aligned to give a maximum correlation 
between the sequences. This may Include Inserting "gaps" in either one 
or both sequences, to enhance the degree of alignment. A % identity 

35 may be determined over the whole length of each of the sequences being 
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compared (so-called global alignment), that is particularly suitable for 
sequences of the same or very similar length, or over shorter, defined 
lengths (so-called local alignment), that is more suitable for sequences of 
unequal length. 

S "SimifanV* is a further, more sophisticated measure of the relationship 

between two polypeptide sequences^ In general, "similarity" means a 

comparison between the amino acids of two polypeptide chains, on a 
residue by residue basis, taking into account not only exact 
correspondences between a between pairs of residues, one from each of 
10 the sequences being compared (as for identity) but also, where there is 

not an exact correspondence, whether, on an evolutionary basis, one 
residue }s a likely substitute for the other. This likelihood has an 
associated "score" from which the similarity" of the two sequences 
can then be determined. 

15 Methods for comparing the identity and similarity of two or more 

sequences are well known in the art. Thus for Instance, programs 
available in the Wisconsin Sequence Analysis Package, version 9.1 
(Devereux J et al. Nucleic Acids Res, 12, 387-395, 1984. available from' 
Genetics Computer Group, Madison, Wisconsin, USA), for example the 

20 programs BESTFIT and GAP. may be used to determine the % Identity 

between two polynucleotides and the % identity and the % similarity 
between two polypeptide sequences. BESTFIT uses the "local 
homology" algorithm of Smith and Waterman (J Mol Biol, 147,195-197, 
1981. Advances in Appiied Mathematics, 2, 1981) and ftnds the 

2S best single region of similarity between two sequences. BESTFIT Is 

more suited to comparing two polynucleotide or two polypeptide 
sequences that are dissimilar in length, the program assuming that the 
shorter sequence represents a portion of the longer. In comparison. GAP 
aligns two sequences, finding a "maximum similarity", according to the 

30 algorithm of Neddleman and Wunsch (J Mol Biol, 48, 443-453, 1970).' 

GAP is more suited to comparing sequences that are approximately the 
same length and an alignment Is expected over the entire fength. 
Preferably, the parameters "Gap Weighr and "Length Weight"' used in 
each program are 50 and 3, for polynucleotide sequences and 12 and 4 

3S for polypeptide sequences, respectively. Preferably, % identities and 

similarities are determined when the two sequences being compared are 
optimally aligned. 
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Other programs for determining identity and/or sfmllarity between 
sequences are also known in the art. for instance the BLAST family of 
programs (Altschul S F et al. J Mol Biol. 215, 403-410. 1990. Aitschul S F 
et al. Nucleic Adds Res.. 25:389-3402. 1997. available from the National 
5 Center for Biotechnology Infbmiation (NCBI). Bethesda, Maryland, USA 

and accessible through the home page of the NCBI at 
ww\ * ^ngbi-»'m.nih.gov) and F^ftTA (nnnnr.n» w n M c thodo in 
Enzymology, 183. 63-89. 1890; Pearson W R and LIpman D J. Proc Nat 
Acad Sci USA, 85. 2444-2448.1988. available as part of the Wisconsin 
10 Sequence Analysis Package). 

Preferably, the BLOSUMea amino add substitution matrix (Henikoff S 
and Henikoflf J Q, Proc. Nat. Acad Sci. USA. 89. 10915-10919. 1982) is 
used in polypeptide sequence comparisons Including where riucieotide 
sequences are first translated Into amino acid sequences before 
15 comparison. 

Preferably, the program BESTFIT is used to determine the % identity of a 
query polynucleotide or a polypeptide sequence with respect to a 
reference polynucleotide or a polypeptide sequence, the query and the 
refierence sequence being optimally aligned and the parameters of the 
20 program set at the default value, as hereinbefore described. 

"Identity Index" is a measure of sequence relatedness which may be 
used to compare a candidate sequence (polynucleotide or polypeptide) 
and a reference sequence. Thus, for instance, a candidate 
polynucleotide sequence having, for example, an Identity Index of 0.95 
25 compared to a reference polynucleotide sequence is Identical to the 

reference sequence except that the candidate polynucleotide sequence 
may Include on average up to five differences per each 100 nucleotides 
of the reference sequence. Such differences are selected from the group 
consisting of at least one nucleotide deletion, substitution. Including 
30 transition and transversion. or insertion. These differences may occur at 

the & or 3' tenminal positions of the reference polynucleotide sequence or 
anywhere between these temninal positions, interspersed either 
Individually among the nucleotides in the reference sequence or in one or 
more contiguous groups within the reference sequence. In other words. 
is to obtain a polynucleotide sequence having an Identity Index of 0.95 

compared to a reference polynucleotide sequence, an average of up to 5 
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in every 100 of the nucleotides of the In the reference sequence may be 
deleted, substituted or inserted, or any combination thereof, as 
hereinbefore described. The sarrte applies mutatis mutandis for other 
velues of the Identity Index, for Instance 0.96, 0.97, 0.98 and 0.99. 

5 Similarly, for a polypeptide, a candidate polypeptide sequence having, for 

oxampl e . an Id e ntity l nde . x o f 0 . 95 comparBri tn a refftrence poiVPgptMg 

sequence is identical to the reference sequence except that the 
polypeptide sequence may include an average of up to five differences 
per each 100 amino acids of the reference sequence. Such differences 

10 are selected from the group consisting of at least one amino acid 

deletion, substitution, including consen/atlve and non-conservative 
substitution, or Insertion. These differences may occur at the amino- or 
carboxy-termlnal positions of the reference polypeptide sequence or 
any\where between these terminal positions, interspersed either 

15 individually among the amino acids in the reference sequence or in one 

or more contiguous groups within the reference sequence. In other 
words, to obtain a polypeptide sequence having an Identity Index of 0.95 
compared to a refisrenoe polypeptide sequence, an average of up to 5 in 
every 100 of the amano acids in the reference sequence may be deleted, 

20 substituted or inserted, or any combination thereof, as hereinbefore 

described. The same applies mutatis mutandis for other values of the 
Identity Index, for instance 0.96, 0.97. 0.98 and 0.99. 

The relationship between the number of nucleotide or amino acid 
differences and the Identity Index may be expressed In the following 
25 equation: 

nasxa-(xa- 1). 

In which: 

na is the number of nucleotide or amino acid differences, 

xg is the total number of nucleotides or amino acids in SEQ ID NO:1 or 
30 SEQ ID NO:2. respectively. 

I is the Identity Index . 



Is the symbol for the multiplication operator, and 



- 28 - 



In which any non-integer product of Xg and I Is rounded down to the 
nearest integer prior to subtracting rt from Xg. 

"Homolog" is a generic term used in the art to indicate a polynucleotide or 
polypeptide sequence possessing a high degree of sequence relatedness 
5 to a reference sequence. Such relatedness may be quantified by 

determining thft Hf>gmR nf triP»ntity anri/nr girr)Hqnty hAtu/Ai=>r> ^ho tyy^ 

sequences as hereinbefore defined. Falling wrthin this generic term are 
the terms "ortholog", and ''paralogy. "Ortholog" refers to a polynucleotide- 
or polypepfide that Is the functional equivalent of the polynucleotide or 
10 polypeptide In another species. "Paralog'' refers to a polynuoleotldeor 

polypeptide that within the same species which is functionally similar 

"Fusion protein" refers to a protein encoded by two, unrelated, fused 
genes or fragments thereof. Examples have been disclosed in US 
5541087, 5726044. In the case of Fc-heparanase-2, emploj^'ng an 

15 immunoglobulin Fc region as a part of a fusion protein is advantageous 

for perTonning the functional expression of Fo-heparanase-2 or fragments 
of heparanase-2, to improve pharmacokinetic properties of such a fusion 
protein when used for therapy and to generate a dimeric heparanase-2. 
The Fo-heparanase-2 DNA construct comprises in 5' to 3' direction, a. 

20 secretion cassette. I.e. a signal sequence that triggers export from a 

mammalian cell, DNA encoding an immunoglobulin Fc region fragment, 
as a fusion partner, and a DNA encoding heparanase-2 or fragments 
thereof. In some uses it would be desirable to be able to alter the intrinsic 
functional propertJes (complement binding, Fc-Receptor binding) by 

25 mutating Uie functional Fc sides while leaving the rest of the fusion 

protein untouched or delete the Fc part completely after expression. 

All publications and references, including but not limited to patents and 
patent applications, cited in this specification are herein incorporated by 
reference in their entirety as if each individual publication or reference 
30 were specffically and individually Indicated to be incorporated by 

reference herein as being fully set forth. Any patent application to which' 
this application claims priority is also Incorporated by reference herein in 
its entirety in the manner described above for publications and 
references. 



Claims 

1 . An isolated polypeptide selected from one of the groups consisting of. 

(a) an isolated polypeptide encoded by a polynucleotide comprising thesequence 
ofSEQIDNOtl; 

1 — (b) an isolated polypeptide cornp r lsiriy a polypeptide sequence having at loQot 
95% identity to the polypeptide sequence of SEQ ID NO:2; 

c) an isoJated polypeptide having at least 95% identity to the polypeptide 
sequence of SEQ ID N0:2; and 

d) the polypeptide sequence of SEQ ID NO:2 and 

to (e)fraigments and variants of such polypeptides In (a) to (d). 

2, The Isolated polypeptide as daimed in claim 1 comprising the polypeptide 
sequence of SEQ ID NO:2. 



15 3. Tl^e isolated polypeptide as claimed In claim 1 which is the polypeptide 
sequence of SEQ ID NO:2. 

4. An isolated polynucleotide selected from one of the groups consisting of. 

(a) an Isolated polynucleotide comprising a polynucleotide sequence having at 
least 95% identity to the polynucleotide sequence of SEQ ID N0:1; 

20 (b) an Isolated polynucleotide having at least 95% identity to the polynucleotide of 
SEQIDNO:1; 

(c) an Isolated polynucleotide comprising a polynucleotide sequence encoding a 
poVpeptide sequence having at least 96% Identity to the polypeptide sequence of 
SEQ ID NO:2: 

25 <d) an Isolated polynucleotide having a polynucleotide sequence encoding a 
polypeptide sequence having at least 96% identity to the polypeptide sequence of 
SEQ ID NO:2; 
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(e) an Isolated polynucleotide with a nucleotide sequence of at least 100 
nucleotides obtained by screening a library under stringent hybridization condrtlons 
with a labeled probe having the sequence of SEQ ID NO: 1 or a fragment thereof 
having at least 1$ nucleotides; 

5 (f) a polynucleotide which Is the RNA equivalent of a polynucleotide of (a) to (e); 

or a polynucleotide sequence complementary to said isolated polynucleotide 

and polynucleotides that are variants and fragments of the above mentioned 
polynucleotides or that are complementary to above mentioned polynucleotides,' 
over the entire length thereof. 



10 

6. An isolated polynudec^ide as claimed In claim 4 selected from the group 
consisting of 

(a) an isolated polynucleotide comprising the polynucleotide of SEQ ID NO:1; 

(b) the isolated polynucleotide of SEO ID N0:1; 

IS (c) an isolated polynucleotide comprising a polynucleotide sequence encoding the 
polypeptide of SEQ ID N0:2; and 

(d) an isolated polynucleotide encoding the polypeptide of SEQ ID NO:2- 



6. An expression system comprising a polynucleotide capable of producing a 
20 polypeptide of dalm 1 when said expression vector Is present in a compatible host 
cell. 



7. A recombinant host cell comprising the expression vector of claim 6 or a 
membrane thereof expressing the polypeptide of claim 1. 



8. A process for producing a polypeptide of claim 1 comprising the step of 
culturing a host cell as defined In claim 7 under conditions sufficient for the 
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production of said polypeptide and recovering the polypeptide from the culture 
medium. 



9. A fusion protein consisting of the Immunoglobulin Fc-reglon and any one 
s polypeptide of claim 1 . 

10. An antibody immunospecific for the polypeptide of any one of claims 1 to 3. 

11. A method for screening to identrfy compounds that stimulate or inhibit the 
function or level of the polypeptide of claim 1 comprising a method selected from- 
the group consisting of: 

10 (a) measuring or, detecting, quantitatively or qualitatively, the binding of a 
candidate compound to the polypeptide (or to the cells or membranes expressfng 
the polypeptide) or a fusion protein thereof by means of a label directly or 
indirectly associated with the candidate compound; 

(b) measuring the competition of binding of a candidate compound to the 
15 polypeptide (or to the cells or membranes expressing the polypeptide) or a fusion 

protein thereof in the presence of a labeled competition 

(c) testing whether the candidate compound results in a signal generated by 
activation or inhibition of the polypeptide, using detection systems appropriate to- 
the cells or cell membranes expressing the polypeptide: 

20 (d) mixing a candidate compound with a solution containing a polypeptide of 
claim 1, to form a mixture, measuring activity of the polypeptide in the mixture, 
and comparing the activity of tine mixture to a control mixture which contains no 
candidate compound; or 

(e) detecting the effect of a candidate compound on the production of mRNA 
25 encoding said polypeptide or said polypeptide In cells, using for instance, an 

ELISA assay, and 

(f) producing said compound according to biotechnological or chemical standard 
techniques. 
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Abstract 

Heparanase-2 polypeptides and polynucleotides and methods for 
producing such polypeptides by recombinant techniques are disclosed. 
Also disclosed are methods for utilizing heparanase-2 polypeptides and 
polynucleotides in diagnostic assays. 
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SEQUENCE LISTING 

<110> Merck Patent casibH 
5 <120> Novel heparanase 
<130> HSPtiaseKDWS 
<140> 

_U) <141> 



15 



20 



<160> 2 

<17C> Patentir* Ver. 2.1 

<210> 1 
<2L1> 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1779} 



S ?2 I" ^s: I- IT; ir. tii q i- 
" r= IS Si Si S5 IS is 

20 2^ 

„ s| SI IS s'. 21 is r/, 
?s i SI If. i£ SI - sj t:'. 

40 5C 55 

„, „= .c= .== =c. f?'^-^ Ill IS r.: IS 

Val 9er Thr Lys Asp Pro Va* Arg Titr vax 

€5 ^® 
eg cag |at ccj .cc a.c a« cat gat gc tgg ccc |at t.c eta 
:.eu Gin Leu Asp Pro Ser Ila lie «ap va^y * f 95 
85 *0 

» SI 5« til fa! S2 s: is s? ii| is 
„ r.3 I!? % i 22 ig IS 
J?', s! s t". r« IS IS III s? lii ^ - ^» 



96 



144 



192 



240 



336 



384 
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aaa aac tac gag gat gac att gtt cga agt gar gtt gcc -eta gat aaa 480 
Lys Asn Tyr Glu Asp Asp lie Val Arg Ser Asp Val Ala Leu Asp lys 
1^5 150 155 160 



5 


cag 
Gin 


aaa 
Lys 


gge 
Gly 


tgc 

Cys 


aag 
Lys 


att 
tie 


gcc 
Ala 


cag 
Gin 


cac 
Hi a 


cct 
Pro 

170 


gat 

Asp 


gtt 
Val 


atg 
Met 


ctg 
Leu 


gtg 
Val 
1*?5 


etc 
Leu 


528 


10 


caa agg gag 
Gin Arg Glu 


aag 
Lys 


gca 
Ala 


set 
Ala 


cag 
Gin 


acg 
Met 


cat 
His 


ctg gtt 
Leu Val 


ctt 
Leu 


eta 
Leu 


aag 
Lys 


gag 
Glu 


caa 
Gin 






trc 
?he 


tee 
Ser 


aat 
Asn 
195 


act 

Thr 


tac 
Tyr 


agt 
Ser 


aat 
Asn 


etc 
Leu 
200 


189 

ata 
He 


tta 
Leu 


aca 
Thr 


gcc 
Ala 


agg 
Arg 
205 


zcr 
Ser 


eta 
Leu 


gac 
Asp 


62 4 


20 


aaa 
Lys 


ctt 
Leu 

210 


tat 
Tyr 


aac 
Asn 


ttt 
Phe 


get 
Ala 


gat 

Asp 
215 


tgc 
Cys 


tct 
Ser 


ega 
Gly 


etc 
Leu 


cac 
His 
220 


ctg 
Leu 


ata 

He 


ttt 

pne 


get 
Ala 


672 


eta 
Leu 
225 


aat 
Aan 


gca 
Ala 


ctg 
Leu 


cgt 
Arg 


cgt 
Arg 
230 


aat 
Asn 


ccc 
Pro 


aat 
Asn 


aac 
Asn 


tec 
Ser 
235 


tgg 
Trp 


aac 
Asn 


agt 
Ser 


tct 
Ser 


agt 
Ser 
240 


720 


25 


gcc 
Ala 


ctg 

Leu 


agt 
Ser 


ctg 
Leu 


ttg 
Leu 
24S 


aag 
Lys 


tac 
Tyr 


age 
Ser 


gcc 

Ala 


age 
Ser 
250 


aaa 
Lya 


aag 
Lys 


tac 
Tyr 


aac 
Aan 


act 
He 
255 


tct 
Ser 


768 


30 


tgg 
Trp 


gaa 
Glu 


ccg 
Leu 


ggt 
Gly 
2^0 


aat 
Asn 


gag 
Glu 


cca 
Pro 


aat 
Asn 


aac 
Asn 
2 65 


tat 
Tyr 


egg 
Arg 


acc 
Thr 


atg 
Met 


cat 
His 
270 


gge 
Gly 


egg 
Arg 


816 


35 


gca 
Ala 


gta 
Val 


aat 

Asn 
275 


gge 
Gly 


age 
ser 


cag 
Gin 


ttg 
Leu 


gga 

Gly 
28C 


aag 
Lys 


gat 
Asp 


tac 
Tyr 


ate 
He 


cag 
Gin 
2B5 


ctg 
Leu 


aag 
Lys 


age 
Ser 


864 


40 


ctg 
Leu 


ttg 
Leu 
290 


cag 
Gin 


ccc 
Pro 


ate 
He 


egg 
Arg 


att 
He 
295 


tat 
Tyr 


tec 
Ser 


aga 
Arg 


gcc 
Ala 


age 
Ser 
300 


tta 
Leu 


tat 
Tyr 


gge 

Gly 


cct 
Pro 


912 


aat 
Asn 
305 


at:; 
lie 


g^g 
Gly 


egg 
Arg 


ccg 
Pro 


s^gg 

Arg 
310 


aag 
Lys 


aat 
Asn 


gtc 
Val 


ate 

He 


gcc 
Ala 
315 


etc 
Leu 


eta 
Leu 


gat 
Asp 


gga 

Gly 


ttc 
Phe 
320 


960 


45 


atg 
Met 


aag 
Lys 


gtg 
Val 


gea 
Ala 


gga 
Gly 
325 


agt 
Ser 


aca 
Thr 


gta 
Val 


gat 
Asp 


gca 
Ala 
330 


gtt 
Val 


ace 
Thr 


tgg 
Trp 


caa 
Gin 


cat 
Kls 
335 


tgc 
Cys 


1008 


50 


tac 
Tyr 


atr 
lie 


gat 

Asp 


gge 

Gly 
340 


egg 
Arg 


gtg 
val 


gtc 
Val 


aag 
Lys 


gtg 
Val 
345 


atg 
Met 


gac 
Asp 


ttc 
Phe 


Ctg 
Leu 


aaa 
Lys 
35C 


act 
Thr 


cgc 
Arg 


1056 


55 


erg 
Leu 


tta 
Leu 


gac 

A9P 
335 


aea 
thr 


etc 
Leu 


tct 
Ser 


gac 
Asp 


cag 
Gin 
3€0 


att 
He 


agg 
Arg 


aaa 
Lys 


att 
He 


cag 
Gin 
365 


aaa 
Lys 


gtg 
Val 


gtt 
Val 


1104 


60 


aat 

Asn 


aca 
Thr 
370 


tac 
Tyr 


act 

xnr 


cca 
Pro 


gga 
Gly 


aag 
Lys 
375 


aag 
Lys 


att 
He 


tgg 
Trp 


ctt 
Leu 


gaa 
Glu 
380 


ggt 
Gly 


gtg 
Val 


gtg 
Val 


ace 
Thr 


1152 


acc 


tea 


get 


gga 


gge 


aca 


aac 


aat 


eta 


tec 


gat 


tec 


tat 


ger 


gca 


gga 


1200 
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Thr Ser Ala Gly Gly Thr Asn Asn Leu Ser Asp Ser Tyr Ala Ala Gly 
385 390 395 <00 

ttc tta tgg trg aac act tta gga atg ctg gcc aat cag ggc att gat 
5 Phe l-eu Tro Leu Asn Thr Leu Gly Met Leu Ala Asa Gin Gly tie Asp 

405 *10 415 

are gtg ata cga cac tea rtt ttt gae cat gga tac aat cac etc gtg 
»ral Val lie Arg His Ser Phe Pbe Asp His Gly Tyr Asn His Leu Val 
10 420 425 430 

gac cag aat ttt aac cca tta cca gac tac tgg ccc net etc etc u«<^ 
Asp Gin Asn Phe Asn Pro Leu Pro Asp Tyr Trp Leu Ser Leu Leu Tyr 
435 440 445 

aag cgc ctg ate ggc ccc aaa gtc ttg get grg cat gtg get ggg etc 
Lys Arg Leu He Gly Pro Lys Val Leu Ala Val His Val Ala Gly Leu 
450 455 460 

cag egg aag cca egg cct ggc cga gtg ate egg gac aaa eta agg att 
G^ n A^g Lys Pro Arg Pro Gly Arg Val He Arg Asp Lys Leu Arg lie 
4€S 470 475 480 

tat get cac tge aca aac cac cac aac cac aac tac gtt cgt ggg tee 
25 Tyr Ala His Cya Thr Asn His His Asn His Asn Tyr vax Arg ^ly Ser 
^ 485 490 495 



15 



20 



att sea ctt ttt ate ate aac ttg oat cga tea aga aag aaa ate aag 
He Thr Leu Phe lie He Asn Leu Kis Arg Ser Arg Lys Lys He -ys 
500 505 5iO 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



ctg get ggg act etc aga gac aag ctg gtt cac cag tac ctg ctg cag 
Leu Ala lly Thr Leu Arg Asp Lys Leu Val His Gin Tyr Leu Leu Glr. 
515 520 523 

" ccc tat ggg cag gag ggc eta aag tec aag tea gtg caa ctg aat ggc 1S32 
Pro Tvr Gly Gin Glu Gly Leu Lys Ser Lys Ser Val G±ti Leu Asf. Gly 
530 535 540 

40 cag ece tta gtg atg gtg gae gac ggg ace etc oca gaa ttg aag eoc 
Gin Pro Leu Val Met val Asp Asp eiy Thr Leu Pro Glu Leu Lys Pro 
545 550 555 560 



SO 560 
taa 



1680 



cgc ccc ctt egg gcc ggc egg aca ttg gtc ate cct cca gtc acc atg n 
45 Sg pre Leu A^g Ila Gly Arg Thr Leu Val He Pro Pro Val Thr Met 

5^5 S"'© ^'^ 

ggc ttt tat gtg gtc aag aar gtc aat get ttg gcc tec cgc cac cga 17 
Gly Phe Tyr Val Val Ly« Asn Val Asn Ala Leu Ala Cya Arg Tyr Arg 
* 580 585 



76 



1779 



<210> 2 
<211> 59a 
<212> PRT 

<213> Homo sapiws 

60 

<400> 2 



: 7- 7- O 



496151727191-* 



+49 89 239^ 



- 36 - 



Mer Pro Ser 
Leu Tyr Leu 



Ser Asn Ser 
15 

Ala Leu Leu 
30 

Pro Leu Pro 




Gin Arg 

25 Phe Ser 

Lys Lsu 
210 
Lei: Aan 
30 225 

Ala Leu 

Trp Glu 

35 Ala Val 

Leu Leu 
290 
Adn IXe 
40 305 

Met Lys 

Tyr lie 

45 Leu Leu 

Asn Thr 
370 
rhr Ser 
50 385 

Pile Leu 

Val Val 

55 A$p Gin 

Lys Arg 
450 
Gin Arg 
60 4 65 

Tyr Ala 



Ala Leu 

Ser Leu 

Leu Gly 
2SC 
Asn Gly 
275 

Gin Pro 

Gly Arg 

Val Ala 

Asp Gly 
340 
Asp Thr 
355 

Tyr Thr 

Ala Gly 

Irp Leu 

lie Arg 
420 
Asn Phe 
435 

Leu He 
Lye Pro 
His Cys 



Tyr Ser 

Phe Ale 

Arg Arg 
230 
Leu Lys 
245 

Asn Glu 

Ser Gin 

lie Arg 

Pro Arg 
310 
Gly Ser 
325 

Arg Val 

Leu Ser 

Pro Gly 

Gly Thr 
390 
Asn Thr 
405 

His Ser 

Asn Pro 

Gly Pro 

Arg Pro 
470 
Thr Aen 



Asn Leu 
200 
Asp Cys 
215 

Asn Pro 

Tyr Ser 

Pro Asn 

Leu GXy 
280 
lie Tyr 
295 

Lys Aen 

Thr Val 

Val Lys 

Asp Gin 
360 

Lys Lys 
375 

hsn Asn 
Leu Gly 
Phe Phe 



Ser Gly 

Asn Asn 

Ala Ser 
250 
Asn Tyr 
2 65 

Lys Asp 

Ser Arg 

Val He 

Asp Ala 
330 
Val Met 
345 

rie Arg 
lie Trp 
Leu Ser 



Thr Ala Arg 
205 

Leu His Leu 

22C 
Ser Trp Asn 
235 

Lys Lys Tyr 

Arg Thr Mer 

Tyr lie Gin 
285 

Ala Ser Leu 

300 
Ala Leu Leu 
315 

Val Thr Trp 

Aap Phe Leu 

Lys He Gin 
365 

Leu Glu Gly 

330 
Asp Ser Tyr 
395 

Ala Asn Gin 



Leu Pro 
440 

Lys Val 
455 

Gly ?Vrg 



Met Leu 
410 

Asp His Gly Tyr Asn 
425 

Asp Tyr 



Leu Ala 
Val lie 
Kls H±s Asn His 



Trp Leu Ser 
445 

Val His Val 

4€0 
Arg Asp Lys 
475 

Aan Tyr Val 



Leu Asp Lys 
160 

Leu Val Leu 

175 
Lys Glu Gin 
190 

Ser Leu Asp 

He Phe Ala 

Ser Ser Ser 
240 

Asn lie Ser 

255 
His Gly Arg 
270 

Leu Lys Ser 

Tyr Gly Pro 

Asp Gly Phe 
320 

Gin His Cys 

335 
Lys Thr Arg 
350 

Lys Val Val 

Val Val Thr 

Ala Ala Gly 
400 

Gly He Asp 

415 
Hia Leu Val 
430 

Leu Leu Tyr 

Ala Gly Leu 

Leu Arg lie 
460 

Arg Gly Ser 
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485 490 495 

lie Thr Leu lie He Asn Leu His Arg Ser Arg Lys Lys lie Lys 

500 505 510 

Leu Ala Gly Thr I«eu Ar; Asp Lys Leu Val His Gin 7yr Leu Leu Gin 

515 520 525 

Pro Tyr Gly Gin Glu Gly Leu Lys Ser Lys Ser Val Gin I*eu Asn Gly 

530 535 540 
Gin Pro Leu Val Met Val Asp Asp Glv Thr Leu Pro Glu I^eu Lys Pro 
545 550 555 560 
Arg Pro Leu Arg Ala Gly Arg Thr Leu Val lie Pro Pro Val Thr Ker 
»6 



Gly Phe Tyr Vsl Val Lys Asn Val Asn Ala Leu Ala Cys Arg Tyr Arg 
580 585 530 
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